Perception Score: A Learned Metric for Open-ended Text Generation Evaluation

نویسندگان

چکیده

Automatic evaluation for open-ended natural language generation tasks remains a challenge. We propose learned metric: Perception Score. It utilizes pre-trained model and considers context information conditional generation. Score assigns holistic score along with the uncertainty measurement. conduct experiments on three two unconditional tasks. achieves state-of-the-art results all consistently in terms of correlation human scores.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Affect Detection from Open-Ended Improvisational Text

We report progress on adding affect-detection to a program for virtual dramatic improvisation, monitored by a human director. We have developed an affect-detection module to control an automated virtual actor and to contribute to the automation of directorial functions. The work also involves basic research into how affect is conveyed through metaphor. The relevance of the project to the sympos...

متن کامل

Machine Translation Evaluation Metric for Text Alignment

As plagiarisers become cleverer, plagiarism detection becomes harder. Plagiarisers will find new ways to obfuscate the plagiarized passages so that humans and automatic plagiarism detectors are not able to point them out. So, a plagiarism detection system needs to be robust enough to detect plagiarism, no matter what obfuscation techniques have been applied. Our system attempts to do the same b...

متن کامل

ترجمه قسمتی از کتاب a text book for midwives

چکیده ندارد.

15 صفحه اول

Closing in on open–ended patient questionnaires with text mining

Knee injury and Osteoarthritis Outcome Score (KOOS) is an instrument used to quantify patients' perceptions about their knee condition and associated problems. It is administered as a 42-item closed-ended questionnaire in which patients are asked to self-assess five outcomes: pain, other symptoms, activities of daily living, sport and recreation activities, and quality of life. We developed KLO...

متن کامل

Exploitation In Affect Detection In Open-Ended Improvisational Text

We report progress on adding affectdetection to a program for virtual dramatic improvisation, monitored by a human director. We have developed an affect-detection module to control an automated virtual actor and to contribute to the automation of directorial functions. The work also involves basic research into how affect is conveyed through metaphor. The project contributes to the application ...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Proceedings of the ... AAAI Conference on Artificial Intelligence

سال: 2021

ISSN: ['2159-5399', '2374-3468']

DOI: https://doi.org/10.1609/aaai.v35i14.17526